Monotonically equivalent entropies and 
solution of additivity equation 



Pavel Gorban 

Omsk State University, Omsk, Russia 

ETH Zurich, Department of Materials, Institute of Polymers 
ETH-Zentrum, CH-8092 Zurich, Switzerland 



Abstract 

Generalized entropies are studied as Lyapunov functions for the Master equation 
(Markov chains). Three basic properties of these Lyapunov functions are taken 
into consideration: universahty (independence of the kinetic coefficients), trace- form 
(the form of sum over the states), and additivity (for composition of independent 
subsystems). All the entropies, which have all three properties simultaneously and 
are defined for positive probabilities, are found. They form a one-parametric family. 

We consider also pairs of entropies Si , 5*2 , which are connected by the monotonous 
transformation S2 = F{Si) (equivalent entropies). All classes of pairs of universal 
equivalent entropies, one of which has a trace- form, and another is additive (these 
entropies can be different one from another), were found. These classes consist of 
two one-parametric families: the family of entropies, which are equivalent to the 
additive trace-form entropies, and the family of Renyi-Tsallis entropies. 



1 Introduction 



The interest to non-classical entropies holds for many decades [1], [2], [3]. 
Some authors even compiled tables of entropies [4]. This interest is supported 
by the successes of "nonextensive statistical mechanics" in the description of 
different phenomena [6], [7]. 

There are many possible frameworks for the consideration of entropies. One 
of them is fixed by the following condition: entropy in isolated systems must 
grow monotonically with time for every physical kinetics. In this work we 
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investigate different entropies as Lyapunov functions for Markov chains [8], 
[9], [10]. 



The basic model we consider here is a finite Markov chain (finiteness and 
discreteness are by no means principal restrictions, and arc employed only in 
order to avoid convergence questions). The time evolution of state probabilities 
Pi, where i is the discrete label of the state, is given by the Master equation 

p.-E%S-|) (1) 



where p* are the equilibrium probabilities and the coefficients must satisfy 
following condition 



We consider only the systems which allow positive equilibrium, p* > (for 
infinite systems, it is often advantageous to use unnormalized p*). 

A function H(p,p*) (p = (pi), p* = (p*)) is a Lyapunov function for the 
Markov chain (1), if its time derivative in accordance to (1) is non-positively 
defined < 0). 

We will consider three important properties of Lyapunov functions H{p,p*): 

1) Universality: H is a Lyapunov function for Markov chains (1) with a given 
equilibrium p* with every possible values of kinetic coefficients kij > 

2) is a trace-form function. 

H{p,p*)=j:f{p„p*) (3) 



where / is a differentiable function of two variables. 

3) H is additive for composition of independent subsystems. It means that if 
P = Pij = Qirj and p* = p*j = q*r* then H{p) = H{q) + H{r) 

Here and further we suppose < Pi,pl,qi,qi ,ri,r* < 1. 

In the next sections we consider the additivity condition as a functional equa- 
tion and solve it. In the section 2 we will describe all Lyapunov functions 
which have all three of these properties simultaneously and in the section 3 
we will find all non-additive trace-form entropies, which become additive after 
monotonous transformation of the entropy scale. 
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2 Non-classic additive entropies 



The following theorem describes all Lyapunov functions for Markov chains, 
which have all three properties 1) - 3) simultaneously. 

Let f{p,p*) be a twice different iable function of two variables. 

Theorem J. If a function if(p, p*) has all the properties l)-3) simultaneously, 
then 

f{p,p*)^p:h{^j, iy(p,p*) = EKM^) (4) 

where 

h{x) = Cilnx + Caxlnx, Ci < 0, C2 > (5) 

Lemma i. If a Lyapunov function H(p) for the Markov chain is of the trace- 
form {H{p) — J2i fiPiiPi)) and is universal, then f{p,p*) — p*h{^), where 
h{x) is a convex function of one variable. 

Proof. Let's consider a Markov chain with two states. For such a chain pi — 
?i2P2(g - ?f ) = -?2iPt(^ ~ 5|) ^ ~^2, (here qijp* = %) so as we have if < 
for if to be a Lyapunov function, we get an equation {^^^^^ — ^^^9pf^^ )){^~ 
^) < 0, from which we can see that ^-^^'^ ^ is a monotonous function of ^, 

P2' — ' op p* ' 

so f{p,p*) = p*h{^), where h is a convex function of one variable. 
Lemma 1 is proven. 

Proof of the theorem 1. Let H have the form (4) and h to be twice differentiable 
in the interval (0, +00). Then the additivity equation 

i/(p) - i/(q) - H{v) = 0, (6) 
holds. Here (in (6)) 

n— 1 m— 1 

i=l j=l 

H{P) = E^rr;M^),//(q) = E^rM|), 

ij ^'i 3 i 

T ■ 

H{r) = E r*h{-^),p - Pij = Qirj. 
j 3 
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Let's take the derivatives of this equation first on qi and then on ri. Then we 
get the equation {g{x) — h'{x)) 



rQin. ,qnri. ,qirm. , ,qnrm. , 
qlrl qlrl qlr*^ qp-*^ 

qin qn^i qiTi q^r:^ qi 

Lets denote x = y = z = and i){x) = g{x) + xg'{x). ft's 

obvious that if n and m are more than 2, then x, y and z are independent and 
can take any positive values. So, we get the functional equation: 

i^{^)^i^{y) + i^{z)-i^{x) (7) 



Let's denote C2 — —'0(1) ^-nd ipi{a) — ip{a) — ■0(1) and take x — 1. We get 
then 



ijjiiyz) = ^i(y) +^1(2;) 



the Cauchy functional equation. The solution of this equation in the class 
of measurable functions is V'i(ck) — C'llna, where Ci is constant. So we get 
ip{x) = Cilnx + C2 and g{x) + xg\x) = Ci Inx + C2. The solution is g{x) — 
^ + Cilna; + C2-Ci; h{x) = J{^ + Ci\nx + C2-Ci)dx = C3 lna; + Cixlna;+ 
(C2 — 2Ci)a; + C4, or, renaming constants, h{x) = Cilnx + C2x\nx + C3X + C4. 
In the expression for h{x) there are two parasite constants C3 and C4 which 
occurs because the initial equation was differentiated twice. So, C3 = 0, C4 = 
and h{x) — Ci In X+C2X In x. Because h is convex, we have Ci < and C2 > 0. 

Theorem 1 is proven. 

In the case when Ci = 0, the function H{x) is continuous in the interval 
[0, +00). When Ci 7^ the function h{x) is defined only in the open interval 
(0, +00). Because of this reason one can easily not to consider the "Burg part" 
of the entropy H{p) = J2iPi{Ci\n^ + C2^ln^) under an assumption that 

H(p) is continuous if some of are equal to zero [2]. 

There is one more entropy, called Hartley entropy, which can be written as 
'S'(p) — J2i sign{pi) , where sign{x) = 1 if a; > and sign{x) = if a; < 0. 
It is additive and it is of trace-form, but we have not found it because it is 
constant when pi > 0. 

Family of the Lyapunov functions (4), (5) was introduced for the first time in 
the book of A.Gorban [10]. The applicability of this entropy for the universal 
description of some systems out of thermodynamic limit was considered in 
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the work [9]. The part Cilnx corresponds to the Burg entropy [11], the part 
C2X In X corresponds to the classical Boltzmann-Gibbs-Shannon entropy. 



3 Trace-form entropies, which become additive after the monotonous 
transformation 

There is a further class of very useful entropies S{p), which are of the trace- 
form, but not additive, and after some monotonous transformation S'i(p) = 
F{S{p)) become additive, but loose their trace-form (such entropies S{p) and 
5'i(p) are in fact equivalent (from thermodynamic point of view), because the 
sets of isoentropic levels coincide). One well known example of such pair of 
entropies is {q ^ 0) 

5(p) = Sf\p) = ^— \ Y,Vi - 1 ) - Tsallis entropy and 
^~ 1 \i=\ ) 

5i (p) = 5f ) (P) = In - Renyi entropy 

Here S^(p) = ^ In((l - q)S(p) + 1) 

So, there is a problem of finding all the entropies with such properties. We 
will give an answer here. 

The relation between Renyi entropy and Tsallis entropy was discussed many 
times. For example, S.Abe wrote "even starting with a nonextensive entropy, 

the thermodynamic entropy appearing at the macroscopic level has to be ex- 
tensive, in accordance with Carathedory's theorem. This fact can be observed 
as transmutation from nonextensive Tsallis theory to extensive Renyi-ent ropy- 
based theory" [12]. 

If two entropies S'i(p) and S'2(p) are connected by the monotonous transfor- 
mation 

5i(p) = F{S,{p)), S,(p) = F-\S,(p)) (9) 
then solutions of two following conditional equilibrium problems coincide 

'S'i(p) — max, m(p) = M 

(10) 

•52 (p) — >■ max, m(p) = M 
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Here M are "macroscopic variables". Let's denote the solution of (10) as Pm 
and 5*1^2 (M) = 5*1^2 (pm)- The connection (9) holds for this "macroscopic" 
entropies (it's obvious): 

52(M) = F(5i(pm)), Si{M) = F-\S2ipM)) (11) 



If entropies are connected by the monotonous transformation, then they are 
equivalent for solution of the conditional equilibrium problems (10). Prom 
this point of view nonextensive Tsallis thermodynamics is equivalent to the 
extensive Renyi thermodynamics (the discussion of sources of "extensivity of 
nonextensive thermodynamics" can be found also in the recent paper [13]). If 
one is interested not only in quasiequilibrium distributions (10), but also in 
numerical means of the entropies, then the differences between equivalent (9) 
entropies can be essential. For example, difference between Renyi and TsaUis 
entropies are displayed when one considers the stability of entropies in the 
thermodynamic limit [14]. Nevertheless, classes of monotonically equivalent 
entropies (9) in many aspects act as one single entropy. 

We will look for all the classes of monotonically equivalent entropies, in which 
there is at least one trace-form entropy and one additive entropy. As in the 
previous part of the work, we consider non-increasing Lyapunov functions for 
Markov chains (minus-entropies). 

The problem is to find all such universal and trace-form Lyapunov functions 
H for Markov chains, that there exists a monotonous function F, such that 
F{H{p)) = F(i/(q)) + F{H{r)) if p = p,j = q,rj. 



With the Lemma 1 we get that H{p) = Y.^,j qlr*h iyf^J , i/(q) = Qlh ( , 
= Y.j'i^jh [-p^j- Let F(x) and h(x) be differentiable as many times as 



needed. Differentiating the equality F{H{p)) = F (H (q)) + F (H (r)) on ri and 
qi taking into account that g„ = 1 — I]i!=/ Qi and = 1 — J2J^i rj we get that 

i^'(^(p))K,n(P) = -F"{H{p))H'Jp)H^^{p), or, if -|» = G{H{p)) 
then 

GWp)) = ^'■'P'^'P' (12) 

qiri \P) 



It is possible if and only if every linear differential operator of the first order, 
which annulates H{p) and J2Pi annulate also 



^;(p)^;(p) 



(13) 
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and it means that every differential operator wfiidi fias tfie form 



^ ^ / dHjp) _ dHjp) \ _d_ ^ / dH{p) _ dH{p) \ _d_ ^ / dHjp) _ dH{p) \ _d_ 
\ dq^ dqa J dqp \ dq^ dq^ J dq^ \ dqa dqp J dq^ 



annulates (13). For /3 = 2, o; = 3, 7 = 4 we get the following equation 



i^i(q,r) 
i"2(q,r) 
i"3(q,r) 



h' 



h' 



\Q3rl, 

'q4n\ 
alrt) 



h' 
h' 



-h' 



q^n 
qlrl 



h" 



qlrl 



Qlrl) 



-h" 



(l¥rn 
-h" 



q^r* 



.92*C 
,13' m, 



air; 



+ 

+ 

= 
(15) 



where 



^i(q>r) 

j 

F2(q,r) =5^r,- 
j 

^3(q,r) =^r,- 



h' 
h' 
h' 



h' 
h' 
h' 



q^r 
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'q^ 



If we apply the differential operator ^ — g^, which annulate our conservation 
law Y^jrj = 1, to the left part of (15), and denote f{x) — xh"{x) + h'{x), 
xi = ^, X2 = %, ^3 = #, yi = ^, 2/2 = ^, ?/3 = ^, Vi = ^, we get the 

y2 ^3 ^4 '1 ' m '2 '3 

equation 



ifixaVs) - /(xsys) - f{x3y4) + f{x2yA)){f{xiyi) - /(xi^/a)) + 

(fixm) - /(a^sys) - f{xm) + f{x3y4)){f{x2yi) - f{x2y2)) + (16) 
{f{x2y3) - fixm) - f{x2yi) + f{xm)){f{x3yi) - f{x3y2)) = 

or, after differentiation on yi and yz and denotation g{x) — f'{x) 

xig{xiyi){xzg{xm) - X2g{x2y3)) + X2g{x2yi){xig{xiyz) - (17) 
-xzg{x'iyz)) + Xzg{xzyi){x2g{x2y3) - xigixm)) = 
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If ya — 1, yi 0, ip{x) — xg{x), we get after multiplication (17) on yi 

(p{xiyi){ip{x3) - ip{x2)) + ip{x2yi){(p{xi) - (fiixs)) + (p{x3yi){(p{x2) - ^{xi)) = 0(18) 



It implies that for every three positive numbers a, /?, 7 the functions Lp{ax), 
ip{Px), '^{■yx) are lineary dependent, and for ip{x) the differential equation 

ax^ip"{x) + bx(p'{x) + c(p{x) = 
holds. This differential equation has solutions of two kinds: 

1) ip{x) = Cix^^ + 6*2X^2, ki ^ k2, ki and k2 are real or complex-conjugate 
numbers. 

2) ip{x) = Cix'' + C2x''\nx. 

Let's check, which of these solutions satisfy the equation (18). 

1) (p(x) = Cix'^^ + C2X^'^ . After substitution of this into (18) and calculations 
we get C'l(:72(y^-y^)(x^X3"-x^X2'+x^2x2'-a;2'a;3'+X2^T3^-x^^^ = 0. 
It means that Ci = 0, or C2 = 0, or ki = 0, or /c2 = and the solution of this 
kind can have only the form i^{x) — C^x^ -\- €2- 

2) ^p{x) = Cix^ + C2x'^lna;. After substitution of this into (18) and some 
calculations if yi 7^ we get C|((x^ — X2)x^ Inxs + (xg — x^)x2 In 2:2 + {x^ — 
x^)x\hixi) — 0. It mean that or C2 = and the solution is ip{x) = Cix^ , or 
A; = and the solution is (f{x) — Ci + C2 In x. 

So, the equation (18) has two kinds of solutions: 

1) ip{x) = Cix^ + C2 

2) if{x) = Ci + C2lnx. 

Lets solve the equation f{x) — xh"{x) + h'{x) for each of these two cases. 
l)ip{x) = Cix'^ + C2; g{x) = Cix'^~^ + ^ there are two possibilities. 

1.1) A: = 0. Then g{x) = f, f{x) = Clnx + Ci, h{x) = Cixlnx + Calnx + 
C3X + C4. 

1.2) A; 7^ 0. Then f{x) = Cx^ -\-Ci\n.x + C2-, and here are also two possibilities. 

1.2.1) A;= -1. Then h{x) ^ Cilv? x + C2x\nx + C^lnx + C^x + C^. 

1.2.2) k ^ -1. Then h{x) = Cix''+^ + C2X In x + C3 In a; + C4X + C5 
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2)^{x) = Ci + Cslnx; g{x) = + f ; f{x) = C^\n^ x + C^lnx + C^- 

h{x) — CiX In^ X + C2X In X + C3 In X + C4X + C5 

(We have renamed constants during the calculations) . 

For the next step let's see, which of these solutions remains good for equation 
(15). The result is that there are just two families of functions h{x) such, that 
equation (15) holds: 

1) h{x) = Cx'' + Cix + C2, ky^O, ky^l, and 

2) h{x) = Cixlnx + C2 Ina; + C3X + C4 

From these equalities it is easy to see, that all universal Lyapunov functions 
H for Markov chains, for which a monotonous function F exists, such that 
F{H{p)) = F{H{c[)) + F{H{r)) if p = pij = q^j-^, have the form 

^(P) = CEK(5)' + Ci (19) 

i 

where A; 7^ and A; 7^ 1, C > 0, or of the form 

H{V)^Y. Pi § 1^ § + ^2 1^ §) + ^3 (20) 

i Pi Pi Pi 

where Ci < and C2 > 0. 

So, there are two families of such entropies: 

1) Classes of equivalent entropies of Renyi - Tsallis (19); 

2) Classes of entropies, which are monotonically equivalent to additive trace- 
form entropies (20). 



4 One remark about dimension 



Formally in our problems we considered Lyapunov functions in the form 
if(p, p*) for Markov chains with every finite number of states. But in the 
proofs it's enough to use only Markov chains with a small number of states: 

1) In the Lemma 1 (transition from the function of two variables f{p,p*) to 
p*h(^)) it's enough to consider Markov chains with two states. 
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2) In the Theorem 1 (the description of all additive trace-form Lyapunov 
functions for Markov chains) it is enough to consider subsystems with 3 states 
and unification of two independent systems, each of which has three states. 

3) In the last result (the characterization of all classes of monotonically equiv- 
alent entropies, in which there is one trace-form entropy and one additive 
entropy) it is enough to consider systems with 6 states and their unification 
(probably, it is enough to consider systems with 4 states, but it is not proven). 



5 Concluding remcirks 

Axiomatic characterization of entropy by it's properties is a very old and 
well investigated topic [15] . The development of nonextensive thermodynamics 
forces us to revisit old questions [16], [17]. In particular, it is the question about 
the frames of investigation and using of the entropies. 

We consider entropy as a Lyapunov function for Markov chains and as a mea- 
sure of deviation from equilibrium. In the class of such functions we have 
found all the entropies, which have the following three properties simulta- 
neously: universality, trace-form and additivity. This is a family of additive 
entropies [10], [9], [18]. Weaker restriction for the entropy is that in one scale 
the entropy has the trace-form, and after the monotonous transformation of 
the scale it becomes additive (but can loose the trace-form) . There is one more 
class of monotonically equivalent entropies, which satisfies these restrictions - 
the class of Renyi-Tsallis entropies. 
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